Dependent nonparametric trees for dynamic hierarchical clustering: Supplementary material

نویسندگان

  • Avinava Dubey
  • Qirong Ho
  • Sinead Williamson
  • Eric P. Xing
چکیده

There exist a wide variety of distributions over trees with infinitely many nodes, including the nested Chinese restaurant process (Blei et al., 2004), the Dirichlet diffusion tree (Neal, 2003), and Kingman’s coalescent (Kingman, 1982). These models differ from the TSSBP in that data can only be associated with a leaf node, or equivalently a full path from root to leaf. We chose to base our clustering model on the TSSBP because, in many applications, it makes sense to associate data with internal nodes. For example, a document may be narrowly about Physics or Biology, or may be a more broad article on the sciences in general.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dependent nonparametric trees for dynamic hierarchical clustering

Hierarchical clustering methods offer an intuitive and powerful way to model a wide variety of data sets. However, the assumption of a fixed hierarchy is often overly restrictive when working with data generated over a period of time: We expect both the structure of our hierarchy, and the parameters of the clusters, to evolve with time. In this paper, we present a distribution over collections ...

متن کامل

Nonparametric K-Sample Tests via Dynamic Slicing

ISSN: 0162-1459 (Print) 1537-274X (Online) Journal homepage: http://www.tandfonline.com/loi/uasa20 Nonparametric K-Sample Tests via Dynamic Slicing Bo Jiang, Chao Ye & Jun S. Liu To cite this article: Bo Jiang, Chao Ye & Jun S. Liu (2015) Nonparametric K-Sample Tests via Dynamic Slicing, Journal of the American Statistical Association, 110:510, 642-653, DOI: 10.1080/01621459.2014.920257 To link...

متن کامل

How networks change with time

MOTIVATION Biological networks change in response to genetic and environmental cues. Changes are reflected in the abundances of biomolecules, the composition of protein complexes and other descriptors of the biological state. Methods to infer the dynamic state of a cell would have great value for understanding how cells change over time to accomplish biological goals. RESULTS A new method pre...

متن کامل

Nonparametric Bayesian Data Analysis

We review the current state of nonparametric Bayesian inference. The discussion follows a list of important statistical inference problems, including density estimation, regression, survival analysis, hierarchical models and model validation. For each inference problem we review relevant nonparametric Bayesian models and approaches including Dirichlet process (DP) models and variations, Polya t...

متن کامل

Binary to Bushy: Bayesian Hierarchical Clustering with the Beta Coalescent

Discovering hierarchical regularities in data is a key problem in interacting with large datasets, modeling cognition, and encoding knowledge. A previous Bayesian solution—Kingman’s coalescent—provides a probabilistic model for data represented as a binary tree. Unfortunately, this is inappropriate for data better described by bushier trees. We generalize an existing belief propagation framewor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014